General method to unravel ancient population structures through surnames, final validation on Italian data.
نویسندگان
چکیده
We analyze the geographic location of 77,451 different Italian surnames (17,579,891 individuals) obtained from the lists of telephone subscribers of the year 1993. By using a specific neural network analysis (Self-Organizing Maps, SOMs), we automatically identify the geographic origin of 49,117 different surnames. To validate the methodology, we compare the results to a study, previously conducted, on the same database, with accurate supervised methods. By comparing the results, we find an overlap of 97%, meaning that the SOMs methodology is highly reliable and well traces back the geographic origin of surnames at the time of their introduction (Late Middle Ages/Renaissance in Italy). SOMs results enables one to distinguish monophyletic surnames from polyphyletic ones, that is surnames having had a single geographic and historic origin from those that started to be in use, with an identical spelling, in different locations (respectively, 76.06% and 21.05% of the total). As we are interested in geographic origins, polyphyletic surnames are excluded from further analyses. By comparing the present location of each monophyletic surname to its inferred geographic origin in late Middle Ages/Renaissance, we measure the extent of the migrations having occurred in Italy since that time. We find that the percentage of individuals presently living in the very area where their surname started to be in use centuries ago is extremely variable (ranging from 22.77% to 77.86% according to the province), thus meaning that self-assessed regional identities seldom correspond to the "autochthony" they imply. For example the upper part of the Thyrennian coast (Northern Latium, Tuscany) has a strong identity but few "autochthonous" inhabitants (∼28%) having been a passageway from the North to the South of Italy.
منابع مشابه
Surname lists to identify South Asian and Chinese ethnicity from secondary data in Ontario, Canada: a validation study
BACKGROUND Surname lists are useful for identifying cohorts of ethnic minority patients from secondary data sources. This study sought to develop and validate lists to identify people of South Asian and Chinese origin. METHODS Comprehensive lists of South Asian and Chinese surnames were reviewed to identify those that uniquely belonged to the ethnic minority group. Surnames that were common i...
متن کاملSurnames and ancestry in Brazil
This paper presents a method for classifying the ancestry of Brazilian surnames based on historical sources. The information obtained forms the basis for applying fuzzy matching and machine learning classification algorithms to more than 46 million workers in 5 categories: Iberian, Italian, Japanese, German and East European. The vast majority (96.7%) of the single surnames were identified usin...
متن کاملSurnames and Y-Chromosomal Markers Reveal Low Relationships in Southern Spain
A sample of 416 males from western and eastern Andalusia has been jointly analyzed for surnames and Y-chromosome haplogroups and haplotypes. The observed number of different surnames was 222 (353 when the second surname of the Spanish system of naming is considered). The great majority of recorded surnames have a Castilian-Leonese origin, while Catalan or Basque surnames have not been found. A ...
متن کاملThe Surname Space of the Czech Republic: Examining Population Structure by Network Analysis of Spatial Co-Occurrence of Surnames
In the majority of countries, surnames represent a ubiquitous cultural attribute inherited from an individual's ancestors and predominantly only altered through marriage. This paper utilises an innovative method, taken from economics, to offer unprecedented insights into the "surname space" of the Czech Republic. We construct this space as a network based on the pairwise probabilities of co-occ...
متن کاملItalian network for human biomonitoring of metals: preliminary results from two Regions.
The Italian program for human biomonitoring (HBM) of chemical elements, PROgram for Biomonitoring of the Exposure (PROBE), started in 2008 with the aim to provide the knowledge about risk assessment of the Italian population following the environmental exposure to metals. The project is implemented through a HBM campaign for the production of data on 19 metals in the blood and serum of subjects...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- Human biology
دوره 84 3 شماره
صفحات -
تاریخ انتشار 2012